Scalable Graph Hashing with Feature Transformation
نویسندگان
چکیده
Hashing has been widely used for approximate nearest neighbor (ANN) search in big data applications because of its low storage cost and fast retrieval speed. The goal of hashing is to map the data points from the original space into a binary-code space where the similarity (neighborhood structure) in the original space is preserved. By directly exploiting the similarity to guide the hashing code learning procedure, graph hashing has attracted much attention. However, most existing graph hashing methods cannot achieve satisfactory performance in real applications due to the high complexity for graph modeling. In this paper, we propose a novel method, called scalable graph hashing with feature transformation (SGH), for large-scale graph hashing. Through feature transformation, we can effectively approximate the whole graph without explicitly computing the similarity graph matrix, based on which a sequential learning method is proposed to learn the hash functions in a bit-wise manner. Experiments on two datasets with one million data points show that our SGH method can outperform the state-of-the-art methods in terms of both accuracy and scalability.
منابع مشابه
Image authentication using LBP-based perceptual image hashing
Feature extraction is a main step in all perceptual image hashing schemes in which robust features will led to better results in perceptual robustness. Simplicity, discriminative power, computational efficiency and robustness to illumination changes are counted as distinguished properties of Local Binary Pattern features. In this paper, we investigate the use of local binary patterns for percep...
متن کاملScalable Image Annotation by Summarizing Training Samples into Labeled Prototypes
By increasing the number of images, it is essential to provide fast search methods and intelligent filtering of images. To handle images in large datasets, some relevant tags are assigned to each image to for describing its content. Automatic Image Annotation (AIA) aims to automatically assign a group of keywords to an image based on visual content of the image. AIA frameworks have two main sta...
متن کاملHash2Vec, Feature Hashing for Word Embeddings
In this paper we propose the application of feature hashing to create word embeddings for natural language processing. Feature hashing has been used successfully to create document vectors in related tasks like document classification. In this work we show that feature hashing can be applied to obtain word embeddings in linear time with the size of the data. The results show that this algorithm...
متن کاملNew approaches for representing, analyzing and visualizing complex kinetic mechanisms
Complex kinetic representations involving thousands of reacting species and tens of thousands of reactions are currently required for the rational analysis of modern combustion systems. In order to represent, analyze and visualize effectively the ignition processes advanced computational techniques will be required. Recently, we introduced a novel concept that captured the principal elemental t...
متن کاملScalable Data Parallel Object Recognition Using Geometric Hashing on Cm-5
In this paper, we present scalable parallel algorithms for object recognition using geometric hashing. We deene an abstract model of CM-5. We develop a load-balancing technique that results in scalable processor-time optimal algorithms for performing a probe on the CM-5 model. Given a model of CM-5 with P PNs and a set S of feature points in a scene, a probe of the recognition phase can be perf...
متن کامل